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We present the first analysis of large-scale clustering from the Spitzer Wide- 
area InfraRed Extragalactic legacy survey: SWIRE. We compute the angular cor- 
relation function of galaxies selected to have 3.6 fim fluxes brighter than 32 /xJy 
in three fields totaling two square degrees in area. In each field we detect cluster- 
ing with a high level of significance. The amplitude and slope of the correlation 
function is consistent between the three fields and is modeled as w(6) = AO 1-1 
with A = (0.6 ± 0.3) x 10" 3 ,7 = 2.03 ± 0.10. With a fixed slope of 7 = 1.8, we 
obtain an amplitude of A = (1.7 ±0.1) x 10~ 3 . Assuming an equivalent depth of 
K fa 18.7 mag we find our errors are smaller but our results are consistent with 
existing clustering measurements in i^-band surveys and with stable clustering 
models. We estimate our median redshift z ~ 0.75 and this allows us to obtain 
an estimate of the three-dimensional correlation function £(r), for which we find 
r = 4.4±0.1^ 1 Mpc. 

Subject headings: large-scale structure of universe — infrared: galaxies — galax- 
ies: evolution — galaxies: statistics 

1. Introduction 

The Spitzer Wide-area InfraRed Extragalactic legacy survey, SWIRE (Lonsdale et al. 
2003, 2004), has recently begun observations. The survey was designed to dramatically 
enhance our understanding of galaxy evolution. We will study the history of star-formation, 
the assembly of stellar mass, the nature and impact of accretion processes in active nuclei, 
and the influence of environment on these processes at all scales. The survey will detect 
around two million galaxies at infrared wavelengths ranging from 3.6 fim to 160 /im, over 
50 square degrees. 

The analysis of the clustering of galaxies has been an important tool in cosmology for 
many years (e.g., Peebles 1980). Originally galaxies were assumed to trace the mass density 
field (modulo some "bias" factor) and their clustering was used to constrain cosmological 
models. For example, the angular correlation function of the APM galaxy survey was able to 
rule out the once standard cold dark matter model (Maddox et al. 1990). Today the cosmo- 
logical models are usually constrained by observations of the cosmic microwave background 
(e.g. Bennett et al. 2003) and conventional models of the evolution of structure under grav- 
ity can then provide us with estimates of the statistical properties of the mass density field. 
Hence studies of galaxy clustering can now be used to understand the relationship between 
galaxy formation and the mass field, i.e. the galaxy bias (e.g. Benson et al. 2001). 
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In this paper we take the first step towards an understanding of the clustering of the 
SWIRE sources, by measuring the angular correlation function of those galaxies that are 
detected at 3.6 /iin (L band) with the Infrared Array Camera (IRAC). For galaxies at 
z ~ 0.6 — 0.7 the 3.6 /iin band probes the rest-frame i^-band and so their clustering can 
be compared directly with that from local i^-band surveys such as the Two Micron All Sky 
Survey (2MASS; Mailer et al. 2003). At this wavelength, the emission is dominated by older 
stellar populations so these galaxies are tracing the sites of star-formation in the distant past 
and we are probing the bias at even earlier epochs. 

2. Catalogs and Sample Selection 
2.1. Catalogs 

The catalogs that we are using are all the ones that were available in March 2004, i.e. 
our validation "tile" in the Lockman hole region (0.4 deg 2 ) and two tiles in the ELAIS Nl 
region (each 0.8 deg 2 ). The data are available from the Spitzer Science Archive as programs 
PROGID=142 and PROGID=185 respectively. The data processing will be described in 
detail in a future paper (Surace et al. 2004). The 5a limit of the 3.6 /zm catalogs is 
5*3.6 — 3.7 /iJy (Lonsdale et al. 2004); in the analysis that follows we consider only those 
galaxies that have S3.6 > 32 /iJy, a flux-limited sample with typically very high signal-to- 
noise ratio (> 40). 

2.2. Star /galaxy Separation 

Contaminating stars will reduce the galaxy clustering signal, and so we have investigated 
two procedures for separating stars and galaxies. The first method uses our supporting 
optical data in the central 0.3 square degrees of the Lockman tile. We select objects with 
optical counterparts and detections in at least four bands (U, g', r', i', 3.6 fjm, 4.5 /mi). 
From these, we reject objects that are morphologically classified as stellar and are brighter 
than r' < 23 mag (below this limit, the star/galaxy separation becomes unreliable). We 
also reject those objects which have good optical/infrared data, but for which no galaxy or 
AGN template spectrum provides a good fit (Rowan- Robinson et al. 2004). This method 
provided us with 872 stars and 2115 galaxies, with 727 IRAC sources rejected. The stellar 
number count models of Jarrett et al. (1994) predict 620 sources, therefore we expect the 
stellar contamination in our catalog to be small. 

In the second method we use the IRAC color from 3.6 to 4.5 pm: C12 = log^^/Sks)- 
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We model this color above £3.6 > 500 fiJy as a Gaussian (assuming these to be stars, but 
fitting only to the positive half of the distribution to avoid residual contamination from 
galaxies). We find a mean C*i 2 = 0.235 and <r Cl2 = 0.014. We then apply a three-sigma cut 
and reject stars with 0.194 < C12 < 0.276. With this color criterion and the 3.6 /zm flux 
limit, the 4.5 /im completeness limit of S4.5 > 18 /xJy has no impact on the identification of 
the stars. In addition, we exclude all objects associated with stars from the 2MASS catalog 
(Cutri et al. 2003). The number of star rejected is shown in Table 1. The stellar count model 
(Jarrett et al. 1994) predicts 2074 stars per square degree, thus this method also excludes 
around 2000 galaxies per square degree with stellar colors. These are a minor fraction of our 
sample (< 20 per cent) and this will not bias our results, so long as the excluded galaxies do 
not cluster differently to the remaining sample. Since we include galaxies with both redder 
and bluer colors, this seems a reasonable assumption. 

We compared the two different methods of star/galaxy separation in the 0.3 square 
degrees of the Lockman field where both methods could be applied. In this region, the optical 
selection yielded 872 stars and the infrared selection method yielded 1100 stars. Of these, 
450 sources are common to both lists, making them the most reliable stellar identifications. 
However, these common sources are not a complete list of the stars - the stellar count model 
(Jarrett et al. 1994) predicts 620 sources in this field, leaving around 170 stars (27 per cent) 
that were not selected by both methods concurrently. Further, if we consider the 2MASS 
point source catalog to be a second reliable list of stellar identifications, then we find that 
96 of these 2MASS stars (23 per cent) were not identified as such using the optical selection 
method. Combining these statistics gives an estimate of the stellar contamination in the 
galaxy catalogs of < 5-8 per cent for the Lockman optical/IRAC catalog, and < 3 per cent 
for the three IRAC-only catalogs. 



2.3. Selection function 

We have adopted a simple but conservative selection function. Our high flux limit 
(Ss2^m > 32 /iJy, typically signal-to-noise > 40) means that even in regions of higher than 
average noise (low coverage) we will still have reasonable signal-to-noise and be well above the 
completeness level, thus our selection function is uniform. A bright star can cause artifacts 
that will affect our source detection, however, so we excluded regions around bright 2MASS 
stars, and also regions near the boundaries of the survey fields. 
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3. The angular correlation function 

We calculated the angular correlation function w(6) = AO 1 ^ 1 following the same tech- 
niques as Gonzalez-Solares et al. (2004), using the Landy and Szalay (1993) estimator. 
We applied corrections for the finite survey area (the integral constraint) and the stellar 
contamination, following the method of, for example, Roche et al. (1999). 

We calculated w(8) for each of our three independent fields: Lockman, ELAIS Nl tile 
2_2 and ELAIS Nl tile 3_2. We have made a seperate measurement in the center of the 
Lockman field with deep optical coverage, using a different star/galaxy classification as 
described in Section 2.2. Each of these correlation functions are plotted in Figure 1 and 
tabulated in Table 1. We note that, as with any angular correlation analysis, the data 
points are not independent. We fit the model to the data over 9 > 0.003 deg (11") - 
on smaller scales the correlation function clearly departs from a power law, indicating an 
excess of close pairs relative to the clustering on larger scales. This excess is most-likely 
due to interacting/merging galaxies (Roche et al. 1999), but our large source detection 
aperture (6") limits our ability to investigate this so we will explore this in more detail in 
a future paper. The strength of clustering in the Lockman optical field appears to higher 
than the other fields; this might be because the additional optical selection criteria reduce the 
effective depth of the field and shallower surveys will always have stronger angular clustering. 
The best-fitting parameters to the three combined samples (excluding the Lockman optical 
dataset) are: A = (1.7 ± 0.1) x 10~ 3 for a fixed 7 = 1.8; and A = (0.6 ± 0.3) x 10~ 3 with 
7 = 2.03 ±0.10 for a free fit. 



3.1. Comparison with A-band surveys 

The sources detected in the 3.6 /xm band will be similar in nature to sources detected in a 
A-band survey, as both sample the old stellar populations. In the absence of A-band data in 
our fields, we used simulated catalogs from Xu et al. (2003) to estimate the effective A-band 
limit in three different ways, giving answers ranging from A < 18.1 mag to A < 19.3 mag. 
Although this model over-predicts the 3.6 /xm number counts (Lonsdale et al. 2004) we only 
need the colours and not the overall normalisation to be correct. 

Firstly we determined the median L — A color of galaxies with S3.6 > 32 /iJy to be 
1.1; our limiting flux thus translates to A < 18.6 mag. Secondly we examined the A- 
band counts of a S3.6 > 32 /iJy simulated catalog and estimated a completeness limit of 
A < 18.1 mag (N.B. the overall normalisation of the count model does not affect this 
result). The parameter of interest is not really the A magnitude but redshift; so for our 
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third method we constructed K magnitude limited samples with varied limits and found 
that a magnitude limit of K < 19.3 gives the same median redshift as we predict for our 
sample (z = 0.75, see Section 3.2 below). Thus we define our effective i^-band limit to be 
K = 18.7 mag, recognizing that there is some uncertainty in this value by up to ±0.6 mag. 

In Figure 2 we compare our amplitude with that from various K-band surveys. Our 
errors are much smaller than existing measurements in these range but there is a very good 
agreement, the main issue is the uncertainty in how to compare the K- and L-band limits. 

Roche et al. (2003) have modeled the clustering of i^-band sources with the auto- 
correlation function evolving as 

e(r,^) = (r/r )^(l + ^)- (3+e) 

Setting the local r = 5.85/i _1 Mpc (Cabanac, de Lapparent and Hickson 2000) and using 
their own model for the redshift distribution, they estimate the angular correlation function 
as shown in Figure 2. We find good agreement with their stable clustering, e = 0, model. In- 
terestingly their model appears to over-predict the strength of the 2MASS clustering (Mailer 
et al. 2003), though these data have been plotted with the slightly shallower best-fit slopes 
7 « 1.76. 



3.2. Estimation of 3D clustering 

We can use our two-dimensional clustering measurement to infer the three-dimensional 
clustering statistics. To do this we need to know the shape of the redshift distribution 
dN/dz(z). In the absence of spectroscopic redshifts or photometric redshifts for all our 
sources we use the model of Xu et al. (2003) to estimate the redshift distribution. 

We know that this model over predicts the 3.6 /zm number counts (Lonsdale et al. 
2004), so it is important to demonstrate that this model can nevertheless predict the shape 
of the redshift distribution. In Figure 3 we compare the model with the observed redshift 
distribution from the K20 sample (Cimatti et al. 2002). The shape is a reasonable fit. 
The median redshift of the K20 sample is z mc( i = 0.74; for our model starbursts and spirals 
we find z me d = 0.92, while for the spheroids z mc d = 0.86, and the model as a whole has 
Zmed = 0.91. The median redshift of the K20 sample is biased by a sharp peak at z ~ 0.74, 
which is presumably an artifact of clustering. So although the K20 sample has a slightly 
lower median z than our model, we regard the approximate agreement between the redshift 
distributions in Figure 3 as reasonable confirmation of the model. 

Using this model we estimate that our S 3 , 6 > 32 /iJy sample has a median redshift of 
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Zmcd ~ 0.75. If the median redshift is lower than this, say by a factor of 0.8 as in the K20 
comparison above then the median redshift might be z mc d ~ 0.6. 

Using the Limber's equation (Limber 1953) inversion technique adopted by Gonzalez- 
Solares et al. (2004), which uses a redshift distribution parameterized by the median redshift, 
we estimate the real-space correlation function £(r) = (r/r ) L8 for our combined sample and 
find ro = 4.4±0.1/i _1 Mpc. The results for individual fields are shown in Table 1. Adopting 
the lower redshift, z me( j = 0.6, this drops to r = 3.3 ± O.lh^ 1 Mpc. For comparison the 
correlation function of quasars in the range 0.3 < z < 2.9 has ro = 3.99lo28/i _1 Mpc, with 
7 = 1.581q'5o (Croom et al. 2001) and hyper-luminous infrared galaxies also have a clustering 
strength similar to AGN at z ~ 0.7 (Farrah et al. 2004). The uncertainty in our correlation 
function is clearly dominated by our uncertainties in the redshift distribution and emphasizes 
the need to establish the redshift distribution of the SWIRE galaxies. 



3.3. Conclusions and future work 

We have performed the first clustering analysis on the SWIRE survey. We have a very 
strong detection of clustering with amplitude similar to i^-band surveys but with smaller 
errors. We are thus consistent with an existing phenomenological model of if-band selected 
galaxies £(r, z) = (r/r )~ 7 (1 + z)~ 3 . Physically this model could be interpreted as stable 
clustering, i.e. that galaxies have broken free from the Hubble expansion, though this seems 
implausible on large scales. However, since if-band surveys sample different galaxies at 
different redshifts the interpretation of the phenomenological model is not this straightfor- 
ward. Now that we have high quality data in the rest-frame K at high redshift we need 
phenomenological models specifically designed to interpret these. 

When we have a larger survey area available, and detailed selection functions, we will 
be able to extend the analysis to our completeness limit, which is fainter by a factor of 
nearly 10 in flux (or 2.5 magnitudes). We will then be able to subdivide our sample by flux 
and explore the evolution of clustering in much more detail. We will investigate the excess 
clustering seen on smaller angular scales, and we will also explore the clustering in the longer 
wavelength IRAC bands and the relative clustering of star-forming galaxies and passively 
evolving systems, thus gaining insights into the nature of galaxy bias. 

We are indebted to Nathan Roche for supplying us with machine readable versions of 
his clustering models. We would like to thank the referee, Dr. Ari Mailer, for very useful and 
constructive comments. This work was supported by PPARC grant PPA/G/S/2000/00508 
(SJO, IW, EGS). Support for this work, part of the Spitzer Space Telescope Legacy Science 
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Program, was provided by NASA through an award issued by the Jet Propulsion Laboratory, 
California Institute of Technology under NASA contract 1407. This publication makes use of 
data products from the Two Micron All Sky Survey (http:/ /www. ipac.caltech.edu/2mass/), 
which is a joint project of the University of Massachusetts and the Infrared Processing and 
Analysis Center /California Institute of Technology, funded by the National Aeronautics and 
Space Administration and the National Science Foundation. 

Facilities: Spitzer Space Telescope. 

Link to datasets used in this analysis. 
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Table 1. Parameterized w(9) estimated from each of our sub samples. 



Sample 


Area 


Wgals 


Altars 


Ml = 1-8) 


A 


7 


n>(z mc d = 0.75) 


n>(zmcd = 0.6) 




[sq. deg.] 






[io- 3 ] 


[io- 3 ] 




[ft-'Mpc] 


[fc^Mpc] 


Lockman (optical) 


0.3 


2115 


872 


2.7 ±0.5 


1.2 ±1.3 


1.97 ±0.25 


5.5 ± 0.5 


4.3 ±0.5 


Lockman full tile 


0.4 


3685 


1551 


2.0 ±0.3 


1.0 ±1.0 


1.95 ± 0.23 


4.6 ± 0.4 


3.7 ± 0.3 


ELAIS Nl tilc.2.2 


0.8 


8677 


4163 


1.7 ±0.2 


0.5 ±0.3 


2.06 ± 0.14 


4.2 ± 0.3 


3.3 ±0.3 


ELAIS Nl tilc.3.2 


0.8 


8472 


4272 


1.7 ±0.2 


0.9 ±0.6 


1.95 ± 0.16 


4.2 ± 0.3 


3.3 ±0.3 


Combined sample 


2.0 


20834 


9986 


1.7 ±0.1 


0.6 ±0.3 


2.03 ± 0.10 


4.4 ±0.1 


3.3 ±0.1 



Column 3 is the number of galaxies included in each sample, and column 4 is the number of stars that were rejected. w(8) is 
modeled as w{9) = AB 1 ^ 7 , with 8 measured in degrees. Column 5 has 7 = 1.8 fixed, while columns 6 and 7 are the results for 
a free fit. Columns 8 and 9 are estimates of the strength of the correlation function £(r) = (r/ro) assuming a median 
rcdshift of z = 0.75 and z = 0.6 respectively. The Lockman (optical) sample is a subset of the Lockman full tile and is not 

included in the combined sample. 
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Fig. I. — The angular correlation function w(9) in each of our fields. The best-fitting power 
law with fixed slope is shown as a solid line. This power law is the same for both ELAIS Nl 
tiles, w(6) = 0.0017 (#/deg)" as , and this is plotted as a dotted line in the other panels for 
reference. 
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Fig. 2. — Amplitude A of the angular correlation function w(8) as a function of i^-band 
magnitude. We have plotted our survey at an effective K = 18.7 mag. Existing data comes 
from Baugh et al. (1996); Roche et al. (1998, 1999); Daddi et al. (2000); McCraken et 
al. (2000); Kummel and Wagner (2000); Roche et al. (2002, 2003); Mailer et al. (2003). 
All data have fixed 7 = 1.8 except for those of Mailer et al. (2003) who found 7 ~ 1.76, 
depending on scale. The lines are the evolving models of Roche et al. (2003), with stable 
(e = 0), co-moving e = —1.2 and intermediate e = —0.4 clustering. An expanded view of 
our new measurement is shown in the inset. 
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Fig. 3. — The redshift distribution from the K20 sample (Cimatti et al. 2002) compared 
with the model of Xu et al. (2003). The model starbursts are shown in blue (middle curve), 
spheroids in red (lower curve) and total counts in green (upper curve). We have normalized 
the total counts in the model to the total number of sources in the K20 sample (dividing by 
a factor of two). 




